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Abstract. We consider A^-fold 4-block decomposable integer programs, which 
simultaneously generalize A'^-fold integer programs and two-stage stochastic in- 
teger programs with A' scenarios. In previous work [R. Hemmecke, M. Koppe, 
R. Weismantel, A polynomial-time algorithm for optimizing over N-fold 4- 
block decomposable integer programs, Proc. IPCO 2010, Lecture Notes in Com- 
puter Science, vol. 6080, Springer, 2010, pp. 219-229], it was proved that for 
fixed blocks but variable N, these integer programs are polynomial-time solv- 
able for any linear objective. We extend this result to the minimization of 
separable convex objective functions. Our algorithm combines Graver basis 
techniques with a proximity result [D.S. Hochbaum and J.G. Shanthikumar, 
Convex separable optimization is not much harder than linear optimization, 
J. ACM 37 (1990), 843-862], which allows us to use convex continuous opti- 
mization as a subroutine. 
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We consider a family of nonlinear integer minimization problems over block- 
structured linear constraint systems in variable dimension. The objective is to 
minimize a separable convex objective function / : R" — >■ M, defined as 



with convex functions /i : M K of one variable each. 

Hochbaum and Shanthikumar |13) present a general technique for transforming 
algorithms for linear integer minimization to algorithms for separable convex integer 
minimization. The key ingredients of this transformation technique are scaling 
techniques and proximity results between optimal integer solutions and optimal 
solutions of the continous relaxation. This technique leads directly to polynomial 
time algorithms if all the subdeterminants of the constraint matrix are bounded 
polynomially. 

Of course, this is quite a restrictive hypothesis, but an important corollary of 
this work is a polynomial time algorithm for minimizing a separable convex function 
over systems of inequalities associated with a unimodular matrix. This generalizes, 
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in particular, earlier work of Minoux |14j on minimum cost flows with separable 
convex cost functions. 

An impossibility result on the existence of a strongly polynomial algorithm for 
minimizing a general separable convex function over network flow constraints has 
been shown in [12) . 

In the present paper, we study a certain family of block-structured separable 
convex integer minimization problems over polyhedra, which does not satisfy the 
hypothesis of polynomially bounded subdeterminants. The constraint matrix of 
these problems is N-fold A-block decomposable as follows: 



C D^^''^ 
B A 



for some given N e Z+ and N copies of A, B, and D. This problem type was 
studied recently in |8|. 

A'^-fold 4-block decomposable matrices arise in many contexts and have been 
studied in various special cases, three of which are particularly relevant. We denote 
by O a zero matrix of compatible dimensions and by • a matrix with no columns 
or no rows. 

(i) For C — ■ and D — ■ we recover the problem matrix (b a) ^^"^ of ^ two-stage 
stochastic integer optimization problem. Then, B is the matrix associated with the 
first stage decision variables and A is associated with the decision on stage 2. The 
number of occurences of blocks of the matrix A reflect all the possible scenarios 
that pop up once a first stage decision has been made. We refer to [11] for a survey 
on state of the art techniques to solve this problem. 

(ii) For B — ■ and C = • we recover the problem matrix {[ ^) of a so-called 
TV-fold integer problem. Here, if we let A be the node-edge incidence matrix of 
the given network and set D to be the identity, then the resulting A^-fold IP is a 
multicommodity network flow problem. Separable convex iV-fold IPs can be solved 
in polynomial time, provided that the matrices A and D are fixed [U [10] . 

(iii) For totally unimodular matrices C, A their so-called 1-sum (qa) totally 
unimodular. Similarly, total unimodularity is preserved under the so-called 2-sum 
and 3-sum composition [211 [H]. For example, for matrices C and A, column vector 
a and row vector b''' of appropriate dimensions, the 2-sum of ( C a ) and {^J^) gives 
( o ) ■ 2-sum oi {q o ) i^s) creates the matrix 



/ C abT abT \ 
[OA O ] 
\0 O A J 



(2) 

which is the 2-fold 4-block decomposable matrix ( ^ ) . Repeated application 
of certain 1-sum, 2-sum and 3-sum compositions leads to a particular family of A'^- 
fold 4-block decomposable matrices with special structure regarding the matrices 
B and D. 

(iv) The general case appears in stochastic integer programs with second order 
dominance relations [4] and stochastic integer multi-commodity flows. See [8 for 
further details of the model as an A^-fold 4-block decomposable problem. To give 
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Figure 1. Modeling a two-stage stochastic integer niulti- 
commodity flow problem as an A^-fold 4-block decomposable prob- 
lem. Without loss of generality, the number of commodities and 
the number of scenarios are assumed to be equal. 



one example consider a stochastic integer multi-commodity flow problem, intro- 
duced in [ini [2D]- Let M integer (in contrast to continuous) commodities to be 
transported over a given network. While we assume that supply and demands are 
deterministic, we assume that the upper bounds for the capacities per edge are un- 
certain and given initially only via some probability distribution. In a first stage we 
have to decide how to transport the M commodities over the given network without 
knowing the true capacities per edge. Then, after observing the true capacities per 
edge, penalties have to be paid if the capacity is exceeded. Assuming that we have 
knowledge about the probability distributions of the uncertain upper bounds, we 
wish to minimize the costs for the integer multi-commodity flow plus the expected 
penalties to be paid for exceeding capacities. To solve this problem, we discretize 
as usual the probability distribution for the uncertain upper bounds into N sce- 
narios. Doing so, we obtain a (typically large-scale) (two-stage stochastic) integer 
programming problem as shown in Figure [TJ Herein, A is the node-edge incidence 
matrix of the given network, / is an identity matrix of appropriate size, and the 
columns containing — / correspond to the penalty variables. 

2. Main results and proof outline 

In j8], the authors proved the following result. 

Theorem 2.1. Let A e Z'*-^^"-^, B e Z'*'^^"^, C G Z-^cxna^ (= Z'^cxn^ he fixed 
matrices. For given N e Z+ let I £ (Z U {-oo})"«+^"'^, u G (Z U {+oo})"«+^"'*, 
b e Z'^c+A^dA^ and let f: K'^b+Nua ^ K « separable convex function that takes 
integer values on '^^b+Nua ^^^^i denote by f an upper bound on the maximum 
of I/I over the feasible region of the N-fold A-block decomposable convex integer 
minimization problem 

(IP)A,ba,uj: min{/(z) : (g2)Wz = b, 1 < z < u, z€Z"-+^"-}. 

We assume that f is given only by a comparison oracle that, when queried on z and 
z' decides whether /(z) < /(z'), /(z) = /(z') or /(z) > /(z'). Then the following 
holds: 
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(a) There exists an algorithm with input N , 1, u, b that computes a feasible solution 
to (IP)Af,b,i.u./ or decides that no such solution exists and that runs in time 
polynomial in N and in the binary encoding lengths (1, u, b) . 

(b) There exists an algorithm with input N , 1, u, b and a feasible solution zq to 
(IP)Ar.b,i,u,/ thcit decides whether zq is optimal or finds a better feasible solu- 
tion zi to the problem (IP)iv.b,i.u./ with /(zi) < /(zq) and that runs in time 
polynomial in N and in the binary encoding lengths (1, u, b, /) . 

(c) For the restricted problem where f is linear, there exists an algorithm with input 
N, 1, u, b that finds an optimal solution to the problem (IP)Ar,b,i,u,/ or decides 
that (IP) Ar,b,i,u,/ is infeasible or unbounded and that runs in time polynomial 
in N and in the binary encoding lengths (1, u,b, /). 

This theorem generahzes a similar statement for A^-fold integer programming 
and for two-stage stochastic integer programming. In these two special cases, one 
can even prove claim (c) of Theorem 12.11 for all separable convex functions and 
for a certain class of separable convex functions, respectively. In [S], it was posed 
as an open question whether Theorem 12.11 can be extended, for the full class of 
N-lold 4-block decomposable problems, from linear / to general separable convex 
functions /. 

In the present paper, we settle this question, proving the following result for 
separable convex functions /, for which we assume that the following approximate 
continuous convex optimization oracle is available: 

Problem 2.2 (Approximate continuous convex optimization). Given the data A, 
B, C, D, N, 1, u, b and a number e e Q>o, find a feasible solution e Q"B+JVnA 
for the continuous relaxation 

(CP)^,b,i,u./ : min {/(r) : ( g ^ ) (^^r - b, 1 < r < u, r G M"«+^"- } . 
such that there exists an optimal solution f to (CP)jv,b,i,u,/ with 

||f-r£||oo < e, 
or report INFEASIBLE or UNBOUNDED. 

Theorem 2.3. For the problem of Theorem \2.1l we assume that the objective 
function f is given by an evaluation oracle and an approximate continuous convex 
optimization oracle for (CP)Ar^b.i,u,/- 

Then there exists an algorithm that finds an optimal solution to (IP) Af,b,i,u./ or 
decides that (IP) Ar^b.i,u./ is infeasible or unbounded and that runs in time polynomial 
in N and in the binary encoding lengths (1, u, b, /). 

The main new technical contribution of the present paper is to combine Graver 
basis techniques with a proximity result developed by Hochbaum and Shanthiku- 
mar [13] in the context of their so-called proximity-scaling technique. 

This allows us to first use the approximate continuous convex optimization oracle 
to find a point, in whose proximity the optimal integer solution has to lie. The 
integer problem restricted to this neighborhood is then efficiently solvable with 
primal (augmentation) algorithms using Graver bases, which will find the optimal 
integer solution in a polynomial number of steps. 

We now briefly explain the Graver basis techniques; we refer the reader to the 
survey paper [TB] or the monograph [T7] for more details. Let E e Z*^^" be a 
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matrix. We associate with E a finite set G (E) of vectors with remarkable properties. 
Consider the set ker(£^) n Z". Then we put into G{E) all nonzero vectors v S 
ker(£') n Z" that cannot be written as a sum v = v' + v" of nonzero vectors 
v', v" € ker(i?)nZ" that lie in the same orthant (or equivalently, have the same sign 
pattern in {> 0, < 0}") as v. The set G{E) has been named the Graver basis of E, 
since Graver 5 introduced this set G{E) in 1975 and showed that it constitutes an 
optimality certificate (test set) for the family of integer linear programs that share 
the same problem matrix, E. By this we mean that G{E) provides an augmenting 
vector for any non-optimal feasible solution and hence allows the design of a simple 
augmentation algorithm to solve the integer linear program in a finite number of 
augmentations. 

The augmentation technique can also be used to efficiently construct a feasi- 
ble solution in the first place, in a procedure similar to phase I of the simplex 
algorithm 

More recently, it has been shown in [16] that GiE) constitutes an optimality 
certificate for a wider class of integer minimization problems, namely for those 
minimizing a separable convex objective function over a feasible region of the form 

{z: Ez = h, 1 < z < u, z e Z" }. 

Moreover, several techniques have been found to turn the augmentation algo- 
rithm into an efficient algorithm, bounding the number of augmentation steps 
polynomially. Three such speed-up techniques are known in the literature: For 
0/1 integer linear problems, a simple bit-scaling technique suffices For general 
integer linear problems, one can use the directed augmentation technique [53], in 
which one uses Graver basis elements v G G{E) that are improving directions for 
the nonlinear functions c'^v"'" -I- d^v^, which are adjusted during the augmentation 
algorithm. For separable convex integer problems, one can use the Graver-best aug- 
mentation technique [lO] , where one uses an augmentation vector v that is at least 
as good as the best augmentation step of the form 7g with 7 e Z+ and g G G{E). 

In [5] , the authors found a way to implement the directed augmentation technique 
efficiently for A'^-fold 4-block decomposable integer programs, despite the exponen- 
tial size of the Graver basis. This gives an efficient optimization algorithm for the 
case of linear objective functions, proving [Theorem 2.11 It is still an open question 
whether the Graver-best augmentation technique can be implemented efficiently. 
This would give an alternative proof of [Theorem 2.31 

The paper [5] and the present paper crucially rely on the following structural 
result about G {(b a) '■^'') : which was proved in [8]. 

Theorem 2.4. If A e Z'^^'^''-^ , B e Z'''*^"^, C £ Z'^cxn^^ (= ^dcxn^ yj^-g^ 
matrices, then max | ||v||i : v G ^ ((^ 2) } is bounded by a polynomial in N. 

We note that in the special case of A^-fold IPs, the £i-norm is bounded by a 
constant (depending only on the fixed problem matrices and not on N), and in the 
special case of two-stage stochastic IPs, the ^i-norm is bounded linearly in TV. This 
fact demonstrates that A^-fold 4-block IPs are much richer and more difficult to 
solve than the two special cases. 
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3. Proof of the results 

3.1. Aggregation technique. We will use an aggregation/disaggregation tech- 
nique, which is based on the following folklore fact on Graver bases (see, for exam- 
ple. Corollary 3.2 in [6]): 

Lemma 3.1 (Aggregation). Let G ^ {F t f) be a matrix with two identical 
columns. Then the Graver bases of {F f ) and G are related as follows: 

g{G) = { (u, V, w):vw> 0, (u, v + w) e g{{F f )) } U {±(0, 1, -1)}. 

Thus, the maximum £i-norm of Graver basis elements does not change if we 
repeat columns. 

Corollary 3.2. Let G be a matrix obtained from a matrix F by repeating columns. 
Then 

max{||v||i : v e ^(G) } = max{2, max{ ||v|| i : v e g{F)}}. 

3.2. Bounds for Graver basis elements. Let us start by bounding the £i-norm 
of Graver basis elements of matrices. The following result can be found, for instance, 
in [m Lemma 3.20]. 

Lemma 3.3 (Determinant bound). Let A E Z™'*" be a matrix of rank r and 
let A(^) denote the maximum absolute value of subdeterminants of A. Then 
max{||v||i : v G Q{A)} < (n - r){r + l)A{A). Moreover, A{A) < (^mAf)™, 
where M is the maximum absolute value of an entry of A. 

As a corollary of Lemma [3.31 and the aggregation technique fCoroUarv 13.21) . we 
obtain the following result: 

Corollary 3.4 (Determinant bound, aggregated). Let A £ ^"ix" q matrix of 
rank r and let d be the number of different columns in A and M the maximum 
absolute value of an entry of A. Then 

max{ ||v||i : v e GiA)} < (d - r)(r + 1)(V^M)". 

For matrices with only one row (m = r = 1), there are only 2M -I- 1 different 
columns, and so this bound simplifies to 4M^. However, a tighter bound is known 
for this special case. The following lemma is a straight-forward consequence of 
Theorem 2 in [3]. 

Lemma 3.5 (PPI bound). Let A G Z^^" be a matrix consisting of only one row 
and let M be an upper bound on the absolute values of the entries of A. Then we 
have max{ ||v||i : v G g{A) } <2M - 1. 

Let us now prove some more general degree bounds on Graver bases that we will 
use in the proof of the main theorem below. 

Lemma 3.6 (Graver basis length bound for stacked matrices). Let L G Z''^" and 
let F G Z"^". Moreover, put E -.= {1). Then we have 

max{||v||i : v G g{E)) < max{||A||i : A G ^(F • GiL))} ■ max{||v||i : v G ^(L)}. 

Proof. Let v G GiE). Then v G ker(L) implies that v can be written as a non- 
negative integer linear sign-compatible sum v = ^ A^g, using Graver basis vectors 
Si G G{L). Adding zero components if necessary, we can write v = g{L)\. We now 
claim that v G g{E) implies A G g{F ■ Q{L)). 
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First, observe that v e ker(F) implies Fv = F ■ {g{L)\) = {F ■ Q{L))X = 
and thus, A e ker(i^ • Q{L)). If A ^ Q{F ■ Q{L)), then it can be written as a 
sign-compatible sum A = + with fJ,,u £ ker(i^ • Q{L)). But then 

V = + {g{L)u) 

gives a sign-compatible decomposition of v into vectors Q{L)^i,Q{L)v e ker(£'), 
contradicting the minimality property of v e G{E). Hence, AG Q{F ■ Q{L)). 

From V — -^igi with gj G G{L) and A G Q{F ■ Q{L)), the desired estimate 
follows. □ 

We will employ the following simple corollary. 

Corollary 3.7. Let L G Z''^" and let ai" G Z" be a row vector. Moreover, put 
E := {'^D . Then we have 

max{||v||i : v G g{E)} < (2 • max{|aTv| : v G g{L)} - 1) • max{|lv||i : v G QiL)}. 

In particular, if M :— max{|a(*'| : i — 1, . . . then 

max{||v||i : v G g{E)} < 2nM (max{||v|| i : v G 

Proof. By Lemma |3.6[ we already get 

max{||v||i : v G g{E)} < max{||A||i : A G Gia'^ ■ 0(1))} ■ max{||v|li : v G g{L)}. 

Now, observe that a^ • G{L) is a 1 x -matrix. Thus, the degree bound of 

primitive partition identities. Lemma 13.51 applies, which gives 

max{||A||i : A G ^^(aT • g{L))} < 2 ■ max{|aTv| : v G g{L)} - 1, 

and thus, the first claim is proved. The second claim is a trivial consequence of the 
first. □ 

Let us now extend this corollary to a form that we need to prove [Theorem 2^ 

Corollary 3.8. Let L G Z'^^" and let F G Z"^". Let the entries of F be bounded 
by M in absolute value. Moreover, put E := {f^). Then we have 

max{||v||i : v G g{E)} < {2nMf^-^ (niax{||v||i : v G giL)}f'" . 

Proof. This claim follows by simple induction, adding one row of at a time, and 
by using the second inequality of Corollarv l3.7l to bound the sizes of the intermediate 
Graver bases in comparison to the Graver basis of the matrix with one row of F 
fewer. □ 

In order to give a proof of lTheorem 2.4[ let us consider the submatrix { b a) ■ 
A main result from [111 is the following. 

Theorem 3.9 (Graver basis for stochastic IPs). Let A e Z"*-*^"-* and B e Z'^^''"^ , 
and let g = g {{ b a) '■^'')- There exist numbers ff,^,/? G Z+ depending only on A 
and B but not on N such that the following holds: 

(a) For every N G Z+ and for every v G g, we have ||v||oo < g, *-e., the components 
of V are bounded by g in absolute value. 

(b) As a corollary, ||v||i < [ub + NnA)g for all v G t?. 

(c) More precisely, there exists a finite set X C Z"^ of cardinality \X\ < and for 
each X G X a finite set Q Z"^ of cardinality \Y:x\ ^ "H such that the elements 
w E g take the form v = (x, y-'^, . . . , y"), with x G X and y^, . . . , y" G Y^. 
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Remark 3.10. The finiteness of the numbers g, ^, 77 comes from a saturation result 
in commutative algebra. Concrete bounds on these numbers are unfortunately 
not available. However, for given matrices A and i?, the finite sets X and for 
X S X can be computed using the Buchberger-type completion algorithm in 
section 3.3]. Thus, the numbers g,£,,ri are effectively computable. 

Combining this result with Corollarv l3.8[ we get a bound for the ^i-norms of the 
Graver basis elements of ( § ^ ) 

Proposition 3.11 (Graver basis length bound for 4-block IPs). Let A G Z'^-**^"-*, 
B G Z''^^"^, C G Z''c'^"«, D G Z'^c'xnA 5g ^jj;g„ matrices. Moreover, let M be 
a bound on the absolute values of the entries in C and D, and let g G Z_|_ be the 
number from Theorem \3.9l Then for any N G Z_|_ we have 



2"C 



max{||v||i:vGg((g2)'''0} 

< i2inB+NnA)Mf''-^ (max{||v||i : v G ^ ((b a)*"^^)}) 

< (2(ns + 7Vn^)M)2'^-i (K + NnA)g) 



If A, B, C, D are fixed matrices, then max{||v||i : v G ^ ((^ 2) ^^^)} is bounded 
by 0{N^ '^^^), o, polynomial in N. 



Proof. The first claim is a direct consequence of Theorem l3.9l and Corollarv l3.8l with 

L = ( ^ ;^ ) (^) , = ( c £> ) (W) ^ and £; = ( g 2 ) • The polynomial bound for fixed 
matrices A, B, C, D and varying N follows by observing that ua, nB,dc, M, g are 
constants as they depend only on the fixed matrices A, B, C, D. □ 

The above result has appeared before in ^ ; we included the proof to make the 
present paper more self-contained. We now complement it with a useful alternative 
bound, which is given by the following new result. 

Proposition 3.12 (Alternative length bound for 4-block IPs). Let A G Z'^^^"-^, 
B G Z'^'i^"^, C G Z'^cxriB^ 2? G Z'^cxriA ^j^g^ matrices. Moreover, let M be 
a bound on the absolute values of the entries in C and D, and let g,^,ri G Z+ be 
the numbers, depending on A and B , from Theorem \3.9[ Then for any N G Z_|_ we 
have 

max{||v||i:vGe((g2)W)} 

<i-{N + 77)" • dc ■ (v/rfc("s + NnA)My'' ■ {ub + NnA)g. 

If A, B, C, D are fixed matrices, then max{||v||i : v G ^ ((g 2) '■^•')} is bounded 
by 0{N'^'^^'^), a polynomial in N . 

Either of the two results implies ITheorem 2.41 

Remark 3.13. Comparing the two results is difficult because bounds for the finite 
number rj{A, B) are unknown. However, one should expect that the bound of 
[Proposition 3.12 is better for matrices with large upper blocks ( ^ ^ ) , whereas the 



bound of [Proposition 3. IT] is better for matrices with large lower blocks { b a)- 
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Proof of \Proposition 3.1S\ Let L = ( ^ ji ) and F = ( c ^ ) = (C, 1), . . . , D). 
First of all, Theoreni lS. 91 (b) gives the bound 

Ml <{nB+NnA)g for v G ^(L), (3.1) 

where 5 is a constant that only depends on A and B. 

We now consider the matrix F ■ Q{L). Each column of it is given by 

N 

Fw = C^ + DY,r with v = (x,yi,...,y^)Gg(i). 

1=1 

By Theorem 13.91 (c). there are at most ^ = 0(1) different vectors x and for each x 
at most r] — 0(1) different vectors y*. We now determine the number a of different 
sums s = X^fc^i that can arise from these choices. This number is bounded 
by the number of weak compositions of N into r/ non-negative integer parts: a < 
C^^-i^) < (N + v)'' = 0{N''). Thus Fg{L) has at most d := ^ ■ <t < (, ■ {N + 7])"^ = 
0{N^) different columns. 

Using the bound on the entries of C and D, we find that the maximum absolute 
value of the entries of FQ{L) is bounded by (tt-b + NnA)M. 

We now determine a length bound for the elements X oi G{F ■ Q{L)). By Corol- 
lary [3^ we find that 

||A||i <d-dc- [ydc{nB + NnA)Mj 

< ^ (A^ + 7^)" • dc • (Vd^{nB + NnA)M^ ■ (3.2) 

Combining the two bounds p.ip and p. 21) using Corollary 13.61 then gives the 
result. □ 

3.3. Constructing a feasible solution. For constructing a feasible solution to 
the problem (IP)Ar,b,i,u,/, we will use the algorithm of Theorem 12. U fa), first intro- 
duced in [8]. For sake of completeness, we describe the algorithm here and thus 
give the proof of lThcorcm 2.11 (a). 

Proof of VTheorem 2.1\ (a). Let TV G Z+, l,u G b e Z'^c+A'dA, pij-st, 

construct an integer solution to the system (§2) ^^^z = b. This can be done in 
polynomial time using the Hermite normal form of ( § 2 ) • Then we turn it 
into a feasible solution (satisfying 1 < z < u) by a sequence of at most 0{NdA) 
many integer linear programs (with the same problem matrix ( § ^ ) ^^''i with 
bounds 1, u adjusted so that the current solution is feasible) with auxiliary objective 
functions that move the components of z into the direction of the given original 
bounds 1, u, see [7]. This step is similar to phase I of the simplex method in linear 
programming. 

In order to solve these auxiliary integer linear programs with polynomially many 
augmentation steps, we use the speed-up provided by the directed augmentation 
procedure |23) . This procedure requires us to repeatedly find, for certain vectors c 
and d that it constructs, an augmentation vector v with respect to the (separable 
convex) piecewise linear function /i(v) = c^v"*" -I- d^v". 

Consequently, we only need to show how to find, for a given solution zg that 
is feasible for (IP)jv b 1 a augmenting Graver basis element v G ^ (( ^ ^ ) ^^^) 
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for a separable convex piecewise linear function /i(v) in polynomial time in N and 
in the binary encoding lengths of zq and of c, d. 

Let us now assume that we are given a solution zq = (xo,yJ, . . . ,y^) that is 
feasible for (IP)jvbTu/i ^^'^ that we wish to decide whether there exists another 
feasible solution zi with h{zi — zq) < 0. By [S1[T2], it suffices to decide whether 
there exists some vector v = (x, y^, . . . , y^) in the Graver basis of ( § ^ ) '"^^ such 
that Zq + V is feasible and h{v) < 0. By [Proposition 3.11| or [Proposition 3.12[ the 
^i-norm of v is bounded polynomially in N. Thus, since is constant, there is only 
a polynomial number of candidates for the x-part of v. Since the bounds given by 
[Proposition 3.1l[ and [Proposition 3.12[ are effectively computable (cf. lRemark 3.10p . 
we can actually list all possible vectors x that satisfy these bounds. 

For each such candidate x, we can find a best possible choice for yi, . . . , y^v by 
solving the following iV-fold IP: 



(g:)(^'x, 

> 



for given zq = (xo,yj!j, . . . ,y^) and x. As shown in the second line, this problem 
does indeed simplify to a separable convex iV-fold IP with problem matrix ( ^ 2 ) 
because Zq = (xq, yj, . . . , y^ ) and x are fixed. Since the matrices A and D are fixed, 
each such iV-fold IP is solvable in polynomial time [10]. In fact, as shown in [9], 
because the function h is "2-piecewise affine" , this problem can be solved in time 
0{N^L) by Graver-based dynamic programming, where L — (c, d, 1, u, Zq, x). 

If the iV-fold IP is infeasible, there does not exist an augmenting vector using 
the particular choice of x. If it is feasible, let v = (x,yi^ . . . ,y^) be the optimal 
solution. Now if we have /i(v) > 0, then no augmenting vector can be constructed 
using this particular choice of x. If, on the other hand, we have /i(v) < 0, then v 
is a desired augmenting vector for zq and we can stop. 

As we solve polynomially many polynomially solvable iV-fold IPs, one for each 
choice of x, an optimality certificate or a desired augmentation step can be com- 
puted in polynomial time and the claim follows. □ 

3.4. Using Hochbaum Shanthikumar's proximity results. Hochbaum and 
Shanthikumar [13j present an algorithm for nonlinear separable convex integer 
minimization problems for matrices with small subdeterminants. The algorithm 
is based on the so-called proximity-scaling technique. It is pseudo-polynomial in 
the sense that the running time depends polynomially on the absolute value of 
the largest subdeterminant of the problem matrix. The results of the paper [13] 
cannot be directly applied to our situation, since the subdeterminants of iV-fold 
4-block decomposable matrices typically grow exponentially in TV. In the following 
we adapt a lemma from [13] that establishes proximity of optimal solutions of the 



(§5)W (zo-fv)=b, 
min ^ /i (v) : 1 < (zo + v) < u, 

v= (x,yi,...,y^) e 



mm < 



1-zo < 




< U - Zq, 



yi,...,y^ e Z"^ 
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integer problem and its continuous relaxation; we do not use the scaling technique, 
however. 

We consider the separable convex integer minimization problem 

min{ /(z) -.Ez^h, 1 < z < u, z G Z" }. (3.3) 

Theorem 3.14 (Proximity). Let f be an optimal solution of the continuous relax- 
ation of (|3.3p . 

min{ /(r) : Er^h, 1 < r < u, r e R" }. (3.4) 

Then there exists an optimal solution z* of the integer optimization problem (|3.3p 
with 

||f-z*||^ <n-max{||v||eo ■■veg{E)}. 

We remark that we actually just need a bound on the circuits of E, which form a 
subset of the Graver basis of E. Hochbaum and Shanthikumar }13j prove a version 
of this result where the maximum of the absolute values of the subdeterminants 
of E appears on the right-hand side. Our proof is almost identical, but we include 
it here for completeness. 

Proof. Let z be an optimal solution of the integer optimization problem p.3p . 
Since z is a feasible solution to the continuous relaxation, there exists a conformal 
(orthant-compatible) decomposition of f — z into rational multiples of the circuits 
of 

n 

f - z = ^ aiu\ ai > 0, e C{E), 

i=l 

where, due to Caratheodory's theorem, at most n circuits are needed. Then 

n n 

f - z = ^ L"d + ^ Au*, 

i=l i=l 

setting Pi — ai — [ai\ . Now we define 

n n 

r* = z + ^/?iu'', and z* = z + ^ [aij u'. 

2=1 i=l 

Since the vectors u* lie in the kernel of matrix E, both z = z* and z ~ r* satisfy 
the equation Ez = b. Moreover, since both f and z lie within the lower and upper 
bounds and the vectors u* lie in the same orthant as f — z, also z* and r* lie within 
the lower and upper bounds. Thus, r* is a feasible solution to the continuous 
relaxation of (|3.3I) . Since z* is also an integer vector, it is a feasible solution to the 
integer optimization problem p.3p . 
We can write 

f — z = [r* — z] + [z* — z]. 

Then we use an important superadditivity property of separable convex functions 
(see [131 Lemma 3.1] and [H]), which gives 

/(f) - /(z) > [/(r*) - /(z)] + [/(z*) - /(z)], (3.5) 

or, equivalently, 

/(f)-/(r*)>/(z*)-/(z). (3.6) 

Since f is an optimal solution to the continuous relaxation and r* is a feasible 
solution to it, the left-hand side is nonpositive, and so /(z*) < /(z). Thus, since z* 
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is a feasible solution to p.3|) . it is, in fact, another optimal solution of the integer 
optimization problem and /(z*) = /(z). 

We now verify the proximity of z* to f . From the definition of z*, we immediately 
get 



[r-zj + [z-z J||^ 



= ||Er=i"*u'-Er=iLa»Ju'L 

-llELi/^.u'L 

< n ■ max{ Iju-'ljoo : j = l,...,n} 

< n ■ max{ ||v||oo : v G Q{E)} . 

This concludes the proof. □ 

As an immediate corollary, we obtain the following result. 

Corollary 3.15. Let e > and let f he an optimal solution to the continuous 
relaxation (|3.4p . Setting 

1' = max{l, [f- (n •£)!]}, 
u' = min{u, [f + (n • }, 

where I = max { ||v|| oo : v G ^(i?) }, we have 

min{ /(z) :Ez^h, 1 < z < u, z G Z" } 

= min{ /(z) :Ez = h, 1' < z < u', z G Z" }. (3.7) 

Later we will use a simple modification of [Corollary 3.15] using an e-approximate 
optimal solution to the continuous relaxation p.4p . 

For E = ( ^ ^ ) , we can control the size of i using |Proposition 3.1l| or 
[Proposition 3.12] and thus obtain an equivalent IP with small (polynomial-sized) 
bounds. 

We note that though the bounds are small, the dimension is still variable, and 
so the problem cannot be solved efficiently with elementary techniques such as 
dynamic programming. In the following subsections, we show how to solve this IP 
with Graver basis techniques. 

3.5. Graver-best augmentation for the restricted problem. In the restricted 
problem, no long augmentation steps are possible, and therefore it is possible to 
efficiently construct a Graver-best augmentation vector. Using this observation, we 
prove the following theorem. 

Theorem 3.16. Let A G Z'*-^^"-*, B G Z'^'^^"«, C G Z''c.^"«, D G Z'^^'^"^ be 
fixed matrices. Then there exists an algorithm that, given N G Z+, c G '^'^"B+kNuA ^ 
b G Z'^c+Nd.A^ 1, u G ^ a feasible solution Zq, and a comparison oracle for 

the function f: R'^b+a^ma _^ jj^ finds an optimal solution to 

min |/(z) : ( g 2 ) '^'z = b, 1' < z < u', z G Z"«+^"'* } 

and that runs in time that is polynomially bounded in N, in k :— ||u' — I'ljoo; cind 
in the binary encoding lengths (b,c, /). 



GRAVER BASIS AND PROXIMITY TECHNIQUES 



13 



Proof. By the Graver-best speed-up technique [10], it suffices to show that for 
a given feasible solution zq, we can construct a vector 7g, where 7 G Z-|- and 
S & G {{b a) '■^■') ' such that zq + jg is feasible, and 7 and g minimize /(zq + 7g) 
among aU possible choices. It actuahy suffices to construct any vector v such that 
Zq + V is feasible and /(zo + v) < /(zq + 7g). 

Write Zq — (xq, yj, . . . , y^ ) and let v = (x, . . . ) be any vector in the Graver basis 
of ( g 2 ) . By |Proposition 3.11| or [Proposition 3A2\ the £1 -norm of v is bounded 
polynomially in A^. Thus, since ub is constant, there is only a polynomial number 
of candidates for the x-part of v. Since the bounds given by |Proposition 3.11| and 
|Proposition 3.12| are effectively computable (cf. [Remark 3.10|) . we can actually list 
all possible vectors x that satisfy these bounds. 

For each such vector x, we now consider all vectors of the form (7X, y-'^, . . . , y^) 
as candidate augmentation vectors, not just multiples 7V of Graver basis elements. 

In the special case x = 0, this is equivalent to the construction of a Graver-best 
augmentation vector for the iV-fold IP with the problem matrix ( ^ 2) which 
can be done in polynomial time [lOj . 

Otherwise, if x 7^ 0, we determine the largest step length 7 e Z+ such that 
Xq -I- 7X lies within the bounds 1', u'. Certainly 7 < fc. We now check each possible 
step length 7 = 1, 2, . . . , 7 separately. To find a best possible choice for yi, . . . ,yN, 
we solve the following iV-fold IP: 



Since the matrices A and D are fixed, each such A^-fold IP is solvable in polynomial 
time [To] . 

If the A^-fold IP is infeasible, there does not exist an augmenting vector using the 
particular choice of x and 7. If it is feasible, let v = (7X, y^ , . . . , y^) be an optimal 
solution. Now if we have /(v) > 0, then no augmenting vector can be constructed 
using this particular choice of x and 7. If, on the other hand, we have /(v) < 0, 
then V is a candidate for the Graver-best augmentation vector. 

By iterating over all x and all 7, we efficiently construct a Graver-best augmen- 
tation vector. □ 

Remark 3.17. A more precise complexity analysis is as follows. 

(a) For the construction in the special case x = 0: In fact, by [9', Lemma 3.4 and 
proof of Theorem 4.2], for any of the possible step lengths 7 = 1, 2, . . . , fc, we 
can find in linear time 0{N) an augmenting vector 7V that is at least as good 
as the best Graver step 7g with g & G {'. a) Checking all step lengths, we 
get a complexity of 0{kN). 

(b) For the solution of the A^-fold subproblem in the general case x ^ 0: This 
optimization, in turn, uses another Graver-best augmentation technique. In 
Phase I, the possible step lengths are large, but the auxiliary objective func- 
tions are linear, and so the running time is O(iV'^L) by Graver-based dynamic 
programming [9] Theorem 3.9], where L = (1', u', Zq, x). In Phase II, there 
are few possible step lengths, 7 — 1, 2, . . . , fc, so we can try them all. By [9] 
Lemma 3.4 and proof of Theorem 4.2], we can find for a fixed 7 in linear time 
O(A^) an augmenting vector 7V that is at least as good as the best Graver 
step 7g with g £ G {'. a) ■ Checking all step lengths, we get a complexity 
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of 0{kN). Using the results of |T0] (modified with the optimahty criterion of 
[16]), the number of Graver-best augmentations is bounded by 0{N{f)). Thus 
the complexity of this subproblem is 0{N'^k{f) + N^L). 
(c) The number of steps in the overall Graver-best augmentation algorithm for the 
restricted 4-block decomposable problem is again bounded by 0{N{f)). 

Remark 3.18. Other augmentation techniques can be used to prove lTheorem 3.161 
For example, following }13[ section 2], we can reformulate a separable convex inte- 
ger minimization problem with small bounds as a 0/1 linear integer minimization 
problem in the straightforward way. Then we can apply the bit-scaling speed-up 
technique, for instance P^ . 

3.6. Putting all together. For each set of fixed matrices A, B, C, D and for 

any function e{N) that is bounded polynomially in N, we consider the following 
algorithm. 

Algorithm 3.19 (Graver proximity algorithm). 

1: input N e Z+, bounds 1, u e right-hand side b e Z'^^+^'^'^, eval- 

uation oracle for a separable convex function /: ^.^b+Nua _^ jj^ approximate 
continuous convex optimization oracle. 

2: output an optimal solution z* to (IP) A',b,i,u,/ or Infeasible or Unbounded. 

3; Let n — ub + Nua denote the dimension of the problem. 

4; Call the approximate continuous convex optimization oracle with e — e{N) to 
find an approximate solution r^ S Q^b+Nua |-]-^g continuous relaxation 

min{/(r) : (g2)(^V = b, 1 < r < u, reM"^+^"-^}. 

5: if oracle returns Infeasible then 

6: return Infeasible. 

7: else if oracle returns Unbounded then 

8: return Unbounded. 

9: else 

10: Compute an upper bound £ on the maximum ^i-norm of the vectors in 

g ((g 2) ^^'')' using [Proposition 3.11| or [Proposition 3.12| 
11: Let r = max{l, [r^ - (n • € + e)lj} and u' = min{u, Ir^ + {n ■ £ + e)l]}. 

12: Let fc = ||u' - I'lloo- 

13: Using the algorithm of [Theorem 2. U fa), find a feasible solution zg for the 
restricted convex integer minimization problem 

min {/(z) : ( g 2 ) ^^^^ = b, l' < z < u', z G Z"«+^"-* } . 

14: Solve the problem to optimahty using the algorithm of [Theorem 3.161 

By analyzing this algorithm, we now prove the main theorem of this paper. 

Proof of lTheorem 2.31 We first show that [Algorithm 3.19[ is correct. If the contin- 
uous relaxation (CP)Ar^b,i,u,/ is infeasible or unbounded, then so is the problem 
(IP)7V_b,i,u,/- In the following, assume that (CP)jv,b,i,u,/ has an optimal solution. 
Then there exists an optimal solution f to (CP)Ar,b,i,u,/ with ||f — r^\\oo < e. By 
[Theorem 3.14[ there exists an optimal solution z* of the integer optimization prob- 
lem (IP)Ar,ba,u,/ with |jf — z*||(x> < n-£. By the triangle inequality, this solution then 
satisfies \\z* ~ r^Woo < n ■ £ + e and is therefore a feasible solution to the restricted 
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IP with variable bounds 1' and u'. Thus it suffices to solve the restricted IP to 
optimality, which is done with the algorithm of [Theorem 3.161 
The algorithm has the claimed complexity because 

k < 2{{nB + Nua) ■ £ + e) 

is bounded polynomially in N by [Proposition 3.l"T] or [Proposition 3.12} The com- 
plexity then follows from [Theorem 
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